Fix parallel tool call limit enforcement #2978

tradeqvest · 2025-09-22T08:38:08Z

Fix parallel tool call limit enforcement

Problem

The tool_calls_limit in UsageLimits was not properly enforced for parallel tool execution. When multiple tools were called in parallel, all tools would start executing before the limit was checked, allowing the limit to be exceeded before raising UsageLimitExceeded.

For example, if tool_calls_limit=6 and the model returned 8 parallel tool calls, all 8 tools would start executing before the error was raised.

Solution

This PR modifies the parallel tool execution logic in _agent_graph.py to enforce the limit before starting tool tasks:

Pre-execution limit check: Before creating async tasks for parallel tools, we now check how many tool calls are remaining within the limit
Immediate error: Raise UsageLimitExceeded if the requested tool amount would violate the usage limit

DouweM · 2025-09-26T22:03:41Z

@tradeqvest Since we'll raise an error regardless, is it worth executing tool calls up to the limit, instead of just raising immediately?

I was thinking we could do something like this:

pydantic-ai/pydantic_ai_slim/pydantic_ai/_agent_graph.py

Lines 476 to 484 in bfcccba

    
           usage = ctx.state.usage 
        
           if ctx.deps.usage_limits.count_tokens_before_request: 
        
               # Copy to avoid modifying the original usage object with the counted usage 
        
               usage = deepcopy(usage) 
        
               counted_usage = await ctx.deps.model.count_tokens(message_history, model_settings, model_request_parameters) 
        
               usage.incr(counted_usage) 
        
           ctx.deps.usage_limits.check_before_request(usage)

Where we optimistically increment a copied version of the usage, and check the usage limit against that.

tradeqvest · 2025-09-26T22:56:48Z

@tradeqvest Since we'll raise an error regardless, is it worth executing tool calls up to the limit, instead of just raising immediately?

I was thinking we could do something like this:

pydantic-ai/pydantic_ai_slim/pydantic_ai/_agent_graph.py

Lines 476 to 484 in bfcccba

usage = ctx.state.usage

if ctx.deps.usage_limits.count_tokens_before_request:

# Copy to avoid modifying the original usage object with the counted usage

usage = deepcopy(usage)

counted_usage = await ctx.deps.model.count_tokens(message_history, model_settings, model_request_parameters)

usage.incr(counted_usage)

ctx.deps.usage_limits.check_before_request(usage)

Where we optimistically increment a copied version of the usage, and check the usage limit against that.

@DouweM It would definitely be simpler, yet I was thinking that the tool output up until the UsageLimit violation could still be of value, captured and further processed. Let me know what you think.

DouweM · 2025-09-29T23:25:09Z

@tradeqvest I think it'd be misleading if those results never get sent back to the model to use, and the user will think their action failed even though some tools (with side effects) may have in fact been executed. If we had a way to, instead of failing hard, tell the model "this call was not executed because you hit the limit" for the calls over the limit, executing the earlier ones makes sense, but until we have such a mode I'd rather not run the tools at all.

tradeqvest · 2025-10-02T08:24:57Z

@DouweM Good point! I've adapted the implementation to use the fail-fast approach you suggested.
Now it checks upfront whether the projected total would exceed the limit and raises immediately without executing any tools from that batch. Thanks for the feedback!

docs/agents.md

pydantic_ai_slim/pydantic_ai/usage.py

pydantic_ai_slim/pydantic_ai/_agent_graph.py

- Removed the unused `parts` variable in `UserPromptNode`.

…fore execution

- Inline limit check in _call_tools instead of separate function - Pass usage directly as parameter rather than extracting from tool_manager - Remove redundant per-tool check in ToolManager - Align error message format with other usage limit errors

- Simplified the tool call logic by removing the unused usage_limits parameter from the _call_tool method in ToolManager.

docs/agents.md

pydantic_ai_slim/pydantic_ai/_tool_manager.py

DouweM · 2025-10-03T22:16:56Z

@tradeqvest Thanks Niko!

tradeqvest force-pushed the fix-tool-call-limit branch from c0165b0 to 5280163 Compare September 22, 2025 08:48

tradeqvest marked this pull request as ready for review September 22, 2025 13:10

DouweM self-assigned this Sep 26, 2025

DouweM added the awaiting author revision label Sep 26, 2025

tradeqvest force-pushed the fix-tool-call-limit branch 3 times, most recently from 9a459ab to f053b90 Compare October 2, 2025 08:01

DouweM requested changes Oct 3, 2025

View reviewed changes

tradeqvest added 5 commits October 3, 2025 22:40

fix: enforce tool call limit enforcement for parallel tool calls

5c0ba65

refactor: remove unused variable

03766ed

- Removed the unused `parts` variable in `UserPromptNode`.

refactor: adapt logic to check projected tool calls against limits be…

5e6540e

…fore execution

refactor: remove usage_limits parameter from _call_tool

6018ae1

- Simplified the tool call logic by removing the unused usage_limits parameter from the _call_tool method in ToolManager.

tradeqvest force-pushed the fix-tool-call-limit branch from 127eee7 to 6018ae1 Compare October 3, 2025 20:40

refactor: add usage limits in retry test

b40a2cd

tradeqvest requested a review from DouweM October 3, 2025 20:54

tradeqvest changed the title ~~fix: enforce tool call limit enforcement for parallel tool calls~~ fix: enforce tool call limits for parallel tool calls Oct 3, 2025

DouweM requested changes Oct 3, 2025

View reviewed changes

docs/agents.md Show resolved Hide resolved

pydantic_ai_slim/pydantic_ai/_tool_manager.py Outdated Show resolved Hide resolved

tradeqvest added 2 commits October 3, 2025 23:42

docs: revert accidental deletion of heading

4c27a90

refactor: simplify usage counting for function tools

372c04c

DouweM changed the title ~~fix: enforce tool call limits for parallel tool calls~~ Fix parallel tool call limit enforcement Oct 3, 2025

DouweM enabled auto-merge (squash) October 3, 2025 22:06

tradeqvest requested a review from DouweM October 3, 2025 22:07

DouweM merged commit 3b8ff2c into pydantic:main Oct 3, 2025
29 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix parallel tool call limit enforcement #2978

Fix parallel tool call limit enforcement #2978

Uh oh!

tradeqvest commented Sep 22, 2025 •

edited

Loading

Uh oh!

DouweM commented Sep 26, 2025

Uh oh!

tradeqvest commented Sep 26, 2025 •

edited

Loading

Uh oh!

DouweM commented Sep 29, 2025 •

edited

Loading

Uh oh!

tradeqvest commented Oct 2, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DouweM commented Oct 3, 2025

Uh oh!

Uh oh!

Fix parallel tool call limit enforcement #2978

Fix parallel tool call limit enforcement #2978

Uh oh!

Conversation

tradeqvest commented Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!